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Abstract 

We study a two-user cognitive channel, where the primary flow is sporadic, cannot be re-designed 
and operating below its Unk capacity. To study the impact of primary traffic uncertainty, we propose 
a block activity model that captures the random on-off periods of primary's transmissions. Each block 
in the model can be spht into parallel Gaussian-mixture channels, such that each channel resembles 
a multiple user channel (MAC) from the point of view of the secondary user. The secondary senses 
the current state of the primary at the start of each block. We show that the optimal power transmitted 
depends on the sensed state and the optimal power profile is paranoid, i.e. either growing or decaying 
in power as a function of time. We show that such a scheme achieves capacity when there is no 
noise in the sensing. The optimal transmission for the secondary performs rate splitting and follows a 
layered water-filling power allocation for each parallel channel to achieve capacity. The secondary rate 
approaches a genie-aided scheme for large block-lengths. Additionally, if the fraction of time primary 
uses the channel tends to one, the paranoid scheme and the genie-aided upper bound get arbitrarily 
close to a no-sensing scheme. 

Index Terms 

Cognitive radio, spectrum sensing, interference channel, Gaussian mixture channel, capacity, side 
information, rate sphtting, water-fiUing. 

I. Introduction 

Cognitive wireless is a novel approach to deploy new wireless services in the presence of 
legacy devices with priority access to the channel. The aim of a cognitive framework is to 
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communicate as an underlay to an underutilized primary network without degrading the primary 
communication beyond a predetermined threshold. The key constraint for our formulation is 
that the primary transceiver is legacy but fixed i.e. it was not designed keeping a cognitive 
secondary in mind. This constraint makes the classical interference channel [1] one sided, i.e, Z- 
interference channel with an additional power constraint. In this paper, we analyze how temporal 
opportunities due to sporadic primary traffic can be exploited, even if the secondary transmitter 
has incomplete information about the current state of primary traffic. 

Our contributions are three-fold. First, we approximate the uncertainty in primary activity 
(interference channel with a sporadic primary) by a simple block activity model where the 
primary changes its state at most once during a block of fixed duration T + 1. In this model, the 
two sources of uncertainty in the primary traffic are captured by the initial state Sq and the time 
of state change r. Their actual values are unknown to the secondary, but their distribution 7r(so) 
and fri'T') are known. We first show that the reliability constraint at the primary receiver, which 
requires that primary transmission should not be harmed, places an additional power constraint 
on the secondary transmitter (see for example, [25]) and the interference caused by the sporadic 
primary at the secondary receiver converts the AWGN channel to a Gaussian mixture channel. 

Second, we present two sense-and-send schemes where the secondary senses the channel at 
the start of each block to look for temporal opportunities. The fixed primary design converts the 
effective channel into a MAC. The secondary splits its rate into two layers [20], [19]. We show 
that it is optimal to treat the primary message either as all public information or as all private 
information but not both because the fixed primary's message cannot be spilt into private and 
public parts. We also present a no-sensing scheme and a genie-aided upper bound. We prove 
that for the sense-and-send schemes the secondary power profile is paranoid, i.e. the power 
monotonically increases or decreases during a block depending on whether the primary was 
using the channel or not during the start of the block. The paranoid profile arises due to the 
effective noise of the mixture channels which decays or grows, depending on the starting state 
of the channel in a block. Additionally, we prove that when the sensing is perfect, the paranoid 
scheme is optimal. The proof is along the lines of a channel with delayed state information [22]. 

Third, we show that if there is no information about the primary traffic the rate splitting 
scheme achieves capacity. The effective channel is converted to a MAC with a fixed transmitter 
from the secondary's point of view and hence rate splitting at the secondary transmitter and 
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sequential decoding at the secondary receiver is optimal. Finally, we show that the paranoid 
scheme approaches a genie-aided upper bound for large block lengths and when the primary 
traffic becomes more persistent, both these schemes get close to the no-information scheme. 

Much like any active area, many variations have been proposed and studied in the literature [12, 
and references therein]. Some of the earUer work [16, 18] in opportunistic spectrum allocation 
for cognitive flows was motivated by studies done by FCC [7] showing significant spectral 
underutiUzation. Many aspects of cognitive radio have been studied including coding with 
degraded messages [5, 23], capacity of the secondary flow with causal or non-causal information 
[15], stability of the queues at both the flows for maximal secondary rate with a guaranteed 
primary throughput [21], spectral shaping [25] etc. The case of an adaptive primary which acts 
as a feedback to the cognitive radio is considered in [6]. When interference is considered in 
terms of SINR constraints (see for example, [4,9-11]), the effect of the primary communication 
at the secondary is not considered. Power control schemes have been derived for different power 
constraints [24,25] when the primary is persistent but operating below capacity. The idea of 
opportunistic interference cancellation was introduced in [19]. However all the above results do 
not address the issue of the uncertainty in the channel when the primary traffic is fixed but 
sporadic. We extend the idea of opportunistic interference cancellation to derive sense and send 
schemes for a fixed and sporadic primary operating below capacity. 

The rest of the paper is organized as follows. In Section n, we present the system model and 
introduce the a block activity model for a sporadic primary. In Section EI, we first present a 
genie-aided bound. Then we derive the paranoid secondary profile for a sense and send protocol 
when the sensing is noisy. Finally, we give two special cases when the sensing is noiseless and 
when no sensing is performed. We conclude with some numerical examples. 

II. System Model 

A. Signal Model 

We consider an interference channel (Figure 1), where the channel inputs and outputs are 
related as, Yi = hiiX[ + /i2i-^2 + -^i ^2 = hi2X[ + /i22-^2 + -^2, where X[,X'2 are channel 
inputs, Yi, Y2 are the channel outputs and Zi, Z2 are independent zero mean, unit variance Gaus- 
sian noise. Each transmitter is individually average power constrained, such that < Pi and 
E[X2] < P2. We define the following for convenient interpretation: SNRi = |/iii|^Pi, SNR2 = 
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Fig. 1. A two-flow cognitive interference network. 



|/i22p^[^2^],INRi = |/i2iP^[X^2],INR2 = |/ii2pPi. We have fixed the primary power to Pi 
(primary uses its full power budget) in our definition of SNRi and INR2, leaving the secondary 
power as an optimization variable in our definition of SNR2 and INRi. Finally, we use the 
standard form [2] of interference channel by assuming \hu\ — |/i22| — 1- Thus the channel input- 
output are related as Yi = VSNRT^i + VINRT^2 + ^1 and Y2 = VlNR^Xi + ^SNR^Xa + Z2, 
where the variance of Xi and X2 is bounded above by one. 

B. Block Activity Traffic Model for the Primary Flow 

The primary is assumed to have the following two properties: its channel is underutilized 
i.e. it sends below the capacity of its own channel when it transmits, and its traffic is sporadic 
i.e. it does not occupy the channel continuously. We model the channel underutilization by 
assuming that the primary employs Gaussian random codes with a code rate of C ( ) , 



where INRgap > and C{x) = ^log2(l + x). The sporadic traffic is modeled by assuming 
that the primary transmits for (3 fraction of time, which results in an average primary rate of 



the primary INR does not go above INRgap. We refer to this as the INR constraint. We capture the 
two uncertainties from the secondary's point of view, the start time of the primary transmission 
bursts and the duration of the bursts, using the following block activity model. The primary 
transmissions are assumed to occur in blocks of T + 1 time-slots, where T is a finite constant. 
All transmissions by the primary and the secondary are considered to be slot synchronous. The 




. Thus a secondary user can use the same channel as long as it ensures that 
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primary user activity is labeled as the state of the primary channel and is denoted by St, where 
< t < T is the time-slot index in a block. The state process is independent of the secondary 
channel's input and output. The primary channel is either in the busy state, st = 1, or in the idle 
state, St = during the time-slot t. To keep the analysis tractable, we assume that the primary 
user changes its state only once at time-slot r in a block (see Figure 2). Thus, the tuple (sq, 
captures the two uncertainties related to the sporadic transmissions. 

The starting state of the primary user for each block is assumed to be independently drawn 
from 7r(so) = [7r(0), 7r(l)]. Conditioned on the starting state, the switching time of the primary 
T has a probability mass function, /^^(r), 1 < r < T + 1 and the corresponding distribution 

r 

function Ft{t) = ^ /r(^)- If r = T + 1, then the primary user does not change its state during 

1=1 

the block. The secondary transmitter is assumed to know INRgap, 7r(so) and frit)- The secondary 
receiver is assumed to have perfect knowledge to do coherent decoding. 

;so = 1 T = 6 ; 
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Fig. 2. Simplified block activity model for the primary packets with only one allowed switching of state. This model is 
characterized by the starting state so, the state-switching time r and the blocklength T + 1. 



Since the primary uses Gaussian codes, from the point of view of the secondary the channel 
always behaves like an AWGN channel. The effective noise as seen by the secondary receiver 
has the mixture distribution for 1 < i < T, 

Zit) - I if St = (probability = ^.(t)) 

1 7V(0, 1 + INR2) if St = 1 (probabiUty = P,{t)) 

where, Ps{t) — 1 — Ps{t)- The term /3s{t) is the probability of the state St — 1 conditioned on 
the starting state s, and is used to derive the power profile in Section III. Note that the /3 is the 

actual primary channel usage fraction and (3s{t) is what is seen by the secondary. A persistent 
primary can be represented by putting /3 = 1. If INRgap = 0, the system is fully loaded and no 
secondary is allowed on the same channel. The interesting case is when INRgap > and /3 < 1. 
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C. Interleaved Block Code for the Secondary Flow 

We now define the set of secondary codes over B consecutive blocks, when the secondary 
has an estimate of the starting state sq for each block. Bold face letters represent the vector 
corresponding to the variable for all blocks, e.g. Sq = Sq'^ ■ ■ ■ , Sq^^) represents the starting 
state of all B blocks. Since the time-slot of each block has identically distributed noise as 
given by Equation (1), the secondary designs T different codebooks each matched to a given time 



slot across B blocks. For example, during block the secondary transmits {x\ ' , 
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where x^j^ denotes the z"^ component of a codeword from the codebook. Similarly the t 
codeword is given by xt = {x[^\xf'^ . . . ,x[^^) as shown in Figure 3. 
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Fig. 3. Codewords are multiplexed across B blocks, i.e. each time slot uses a different codeword which spans across B blocks. 
The figure shows the sensing operation at the beginning of each block and the codeword, {x[^\ . . . ,x[^^). Each of the T 
codewords are picked from the corresponding T codebooks. The block size T is a positive constant, however, the code length 
B tends to infinity. 



A cognitive interleaved block code of length N = BT consists of T separate [B, 2^^^^^\ e)- 
codes, corresponding to each time slot of a block. Each of these codebooks further consists 
of two component codebooks corresponding to the sensed starting state of the block sq which 
may or may not match the actual starting state sq. The codebooks are defined by a set of 2T 
encoding functions that map the set of equiprobable messages {!,..., 2^^} to channel inputs. 
The transmitter encoding function is defined as ftso ■ Wx<S— > Xfort — 1,...,T and Sq — 1,2, 
i.e. Xj^Q = /tso(w, So). The decoding function g : xS^ maps the received vector to the 

2BR 

message set, leading to probability of error Pe = i^hi Yl J2pi^o) Yl -Pe(y|f (O^ ^o)- The 
rate R is e-achievable if there exists a cognitive block code for sufficiently large N — BT and 
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Pg < e. A rate R is said to be achievable if there exists an e-achievable code for every e > 
and the capacity is defined to be the supremum of all the achievable rates R. 

Finally consider the Han-Kobayashi scheme for interference channels where each user's data 
is split into private and public parts. The public information is decodable by both decoders and 
the private information is decodable by the intended decoder. A special case of this structure is 
usable in our model. Firstly, since the primary encoder is fixed, its data can be considered either 
public (if the secondary can decode it, i.e. the cross channel can support the rate) or private, 
but it cannot be split into both. Secondly, since the primary decoder is fixed, it considers all the 
secondary information as noise, i.e. private. Since the primary data is fixed, the secondary can do 
rate splitting at the encoder and sequential decoding at the decoder without losing optimality. We 
will call the first layer the single user codeword with power psm since it is decoded using a single 
user decoder (treating everything else as Noise), and the second layer the multiuser codeword 
with power p^st since it is decoded using a multiuser receiver (with Successive interference 
cancellation). The subscript s, s will be replaced by its value (0 or 1) depending on the context. 

III. Main Results 

In the block activity model, the two sources of uncertainty in the primary traffic are captured 
by the initial state sq and the time of state change r. Their actual values are unknown to 
the secondary, but their distribution /t(t) and 7r(so) are assumed to be known. Our aim is to 
understand how lack of knowledge of these two parameters impacts the secondary rate. We first 
derive the case where a Genie provides the information about (so,t) which serves as an upper 
bound and a design motivation for the general case of Section III-B, where the secondary has 
an estimate sq of the initial state sq and has no knowledge of r. We then consider two important 
special cases of the general scheme. First, we consider the case of perfect sensing, sq = Sq, and 
derive stronger results about the optimal secondary transmission design. Second, we consider 
the case where the secondary has no information about any of the unknown parameters, which 
is equivalent to Sq not providing any useful information about Sq. 

A. Genie-aided Case: Secondary has perfect estimate of (sq, r) 

In this section, we assume Genie-aided knowledge of (so,r) at the secondary. As described in 
Section II-B, the primary flow has two states, on and off. Hence the secondary transmitter uses 
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two interleaved codes matched to the two channel states. Such a solution is called water-filling 
or water-pouring [8], as the variance of the two codebooks is inversely proportional to the noise 
variance of the corresponding channel state. We use the following result from [17] to show that 
Gaussian codebooks are optimal for the secondary. 

Lemma 1 (Optimality of Gaussian codewords, see [17, Theorem 1]): For a single-user scalar 
additive noise channel with nearest neighbor decoding irrespective of the noise distribution, the 
average probability of error over the ensemble of Gaussian codebooks of power P, approaches 
zero as the blocklength n tends to infinity for code rates below log(l + P/N) (and approaches 
one for rates above log(l + P/N)). 

Since the primary uses a Gaussian codebook, the equivalent noise at the secondary is also 
Gaussian. Further since the decoder of the primary is fixed, any distribution of noise at its 
receiver does not change the primary capacity as long as the variance (interference plus noise) 
is below INRgap. Hence Gaussian codebooks are optimal for the secondary. The secondary uses 
a Gaussian codebook with power po when the primary is off, and a superposition Gaussian 
codebook with power {piN.pis)- Define INRc = and SICc = |/ii2p (INRgap + 1) - 1 to 
characterize the power allocation for the Gaussian codebook that achieves capacity. 

Theorem 1 (Genie-aided Upper Bound): When SICc > min (SNR2, ENRc), then the optimal 
power allocation {pq, pm, Pis) for the genie-aided case is given by 

(SNR2, 0, SNR2) if SNR2 < INRc 

^- SNR2-/3INRc ^ 0, INRc) if INRc < SNR2 ' 

On the other hand, when SICc < min (SNR2, INRc), then the optimal power allocation (po, Pin, Pis) 
is given by, 

(2, 0, SICc) if SICc < SNR2 < SIC^ 

< (7 + /3a, 7 - Pa, SICc) if SIC'c < SNR2 < SICc + INRc - SICc , (3) 
(^,S, SICc) if SNR2 > SICc + INRc - SICc 

where SICc = SICc + /3INR2, a = SICc + INR2, 7 = SNR2 - /3SICc and S = INRc - SICc. 
Here, P and ^ = 1 — ^ are the on and off fractions of the primary, respectively. 

Proof: The proof is based on two steps. First we use Lemma 1 to show that Gaussian 
codebooks are optimal for the secondary transmissions. Then the optimal power allocation is 
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obtained by solving the following optimization problem. 




) 



+ C(P15) 



(4) 



subject to ^po + PipiN + Pis) < SNR2 




Pl5< |/il2r(l+INRgap)-l. 



The rate achieved is a comer point of the 3 user virtual MAC formed by the primary and the two 
code layers of the secondary. Hence the sequential decoding at the secondary receiver achieves 



As shown in Figure 4, power allocation can be viewed as water-filling with a layer in the 
middle representing the active primary. For the time-slots when primary is off, the secondary 
uses a single-user codebook matched to its own channel capacity. When the primary is on, 
the secondary transmitter splits into a two-layer Gaussian superposition codebook as shown in 
Figure 4 followed by a sequential decoding at the secondary receiver. The sequence of decoding 
at the secondary receiver is as follows. First, Layer 1 is decoded by treating everything else as 
noise, then the primary codeword is decoded by treating Layer 2 as noise and finally Layer 2 is 
decoded interference-free. 

B. The General Case: Secondary with noisy estimate of primary state sq 

When the secondary does not know the channel states st exactly for each time slot, it senses 
the channel in the beginning of each block sq (noisy estimate of sq). The effective channel is a 
Gaussian mixture channel as given by Equation (1). A given time-slot (for the same sensed state) 
has the same channel statistics across different blocks as shown in Figure 3. We take a cue from 
the form of Genie-aided code in Theorem 1 and use a general superposition code. If sq = 0, the 
secondary transmitter sends at a power level po = Pon + Pos-, (where po5 = (Posi, ■ ■ ■ , Post) 
etc.) and if sq — 1, it sends at a power level pi — puv + pis- The above time-dependent 
power allocation exploits secondary's knowledge of the conditional probability distribution of 
effective noise in each time slot caused by the primary. We shall omit the subscript in sq, ^0 
in the summation indices to avoid clutter. The error in the state estimate is characterized by 
the probability of missed detection Vm = P{so = 0\so = 1) and the probability of false alarm 



capacity. Complete details are in Appendix A 
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Fig. 4. Layered water-filling: power allocation can be thought of as filling a layered bucket with water. The presence of the 
primary causes one of the layers to be fixed. The size of the fixed layer (in red) is a function of INR2, the size of Layer 2 
depends on the interference gap INRgap and the rest of the power is put into Layer 1. When the primary is persistent, there 
is only one bucket to fill, as shown here. For a sporadic primary, there is another bucket without any layers for po- For the 
block activity model, each of the time slots across the block look alike with (pjv,ps) replaced by [poNt, post) or {piNt, pist) 
depending on the starting state. 



Vf = P{so = l|so = 0). The exact capacity with noisy state estimate sq is unknown. To derive 
an achievable rate using Gaussian codes for secondary transmissions, we extend the optimization 
problem in (5) by generalizing the rate and the three constraints for each sensed state. 

The average power constraint can be computed by summing the power levels weighted with 
the appropriate probability of occurrences, 



"^{pom + Post) + ^ X^(Piivt + pist) 



T + 1 A^^--'^ --^-^ T+_ 

t=i t=i 

+ y . Z^iPoNt + Post) + 7fr—7 Z^ipiNt + Pist) < SNR2 

t=l t=l 

T 

i.e. Yl ^(^) Yl Pi^\s)Y.^PsNt + Psst)<{T+l)SNR2. (5) 

s6{0,l} se{o,i} t=i 
Similarly the rate has to be calculated for the four possible combinations of (so,So). When 
the primary user is switched 'off or 'on,' the effective secondary channel is AWGN with noise 
M{0, 1) or M{0, 1 + INR2) respectively. When there is no primary on the channel, the secondary 
can support a rate of C (^ 1^°^!./ ) + C(post) = C(po5i + PoNi) and in the presence of the primary, 
the secondary can support a rate of C (^ i^i^^!^lp^^^ + C{posi)- 
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a) So = 0, So = 0; When sq — 0,So — 0, the secondary correctly detects that the primary 
is silent and sends po = {Pon^Pos)- The achievable rate averaged over all possible primary 
switches is given by, 



T 



Mt) (J2^{pos^ + Pom) + E ("^ (l 

\i=l i=t ^ ^ 



Roo — ^ 

= E 

t=i 

T 

= Y,{^oit)Cso + Poit)Csi), 



t=l 

T r- 

(1 - FT{t))C{post + Pom) + Frit) { C 

t=i - 

T 




t=l 

Where C,o = C{psst + Psm), Cn = (c [j^^f^^^^ + C{psst)), h{t) = 1 - Frit) and ^o(i) = 
Frit). Lemma 3 (see Appendix B) was used to change the summation order in step (a). 

b) So — 1, So — I-' In this case, as the secondary correctly detects the primary user and 
sends po = (pon, Pos) and the effective noise at the secondary receiver has a variance of I+INR2 
in the beginning of the block which changes to 1 when the primary user changes its state at 
time-slot t — t. Probability of this event s = 1, is 7r(l). The rate that can be achieved in this 

case is given by, i?n = + Mt)Cso), where ^i(i) = Frit) and ^i(i) = 1 - Frit). 

t=i 

c) So — I, So — 0; Even though the primary user is transmitting, the secondary detects 
that there is no primary packet on the channel, it sends {poN,Pos)- In this case the primary 
user is actually on during the start of the block, the noise that the secondary receiver sees has a 
variance of 1 + |/ii2pSNRi and it changes to 1 when the primary changes its state. Probability 
of the event (sq = 1, sq = 0), is t:{1)Vm- The rate achieved for a given t this case is given by, 

Rio = Y.{Pi{t)Csi + Mt)Cso). 

t=i 

d) So = 0, Sq = 1; In this case, as the secondary detects the primary user and it sends 
(pii, . . . , Pit), but the primary user is actually off during the start of the block, so the noise that 
the secondary receiver sees in the beginning is N2 and it changes to Pi + N2 when the primary 
changes its state. Probability of the event (sq = 0, sq = 1) is 7r{0)VF- The rate achieved for a 

T 

given t is given by, Roi = Y.Woit)^so + Po{t)Csi)- 

t=i 

Adding up all the terms derived above after weighting them with the appropriate probability of 
occurrences, leads to the average rate of R2{p) — 2(r+i) ['^{^){^~T^f)Roo + '^{^){^ — T^m)Rii + 
7r{l)VMRw + A0)VFRoi]- 
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Two additional constraints are required to complete the problem formulation. First the INR 
constraint which is imposed due to the constraint on the maximum noise variance that can be 
experienced at the primary receiver, psm + Psst < INRc, for t E [l.T] and s E {0, 1}. Second, 
the SIC constraint which is imposed to ensure that the primary information can be decoded in the 
presence of the multiuser codeword psst, after the single user codeword psm has been decoded 
out. This gives rise to the condition, C ^jip^^^ < C which is same as psst < SICc, 

for t E [1,T] and s E {0, 1}. Note that these two constraints hold for each state and time slot in 
contrast to the average power constraint which are an average constraint. We solve the following 
optimization problem, to find the optimal power profile. 



1 ^ 
max i?2(p) = ^^-^^ J]7r(s)X^P(s|s) J](^,(t)C,o + ^.WC,i), (6) 

^ ' s s t=l 



where $ is the constraint set defined by the INR, SIC and average power constraints derived 
above. Additionally, we have to consider positivity constraints for all the power variables, i.e. 
PsNuPsSt > for s e {0, l},t E [1,T]. This is with the understanding that the constraints do 
not become infeasible, i.e. r,INRgap, SNR2 > and if |/ii2p(INRgap + 1) < 1, PsSt = 0. We will 
assume complementarity and positivity but not discuss it further due to lack of space. 

Theorem 2 (Monotonicity of the power profile): For the optimization problem given in (6), 
Poivi > P0JV2 > ■ ■ ■ > pInt and p^jvi < P\n2 <■■■< pInt- Additionally, 

(i) If P*oNt = then pl^^^^, p*^^^ = 

(ii) If plj^t = then p*jvi, ■ ■ ■ , Plm-i = 

(iii) If = INRc then . . . , p*^^_^ = INRc 

(iv) If pl^^ = INRc then ptjvt+i> Pmr = INRc 

Proof: See Appendix C. ■ 
Due to the generality of our problem formulation in (6), the exact form of the power distribution 
cannot be derived in closed form. However, Theorem 2 proves a very important result about the 
monotonicity of power allocation across time-slots. The optimal power profile is non-increasing 
in time if the start state is sq — 0. That is the secondary gets paranoid over time since it does 
not know when the primary transmitter will start transmitting. So it is better for the secondary 
to send more power in the initial time-slots and become more conservative as time progresses. 
In contrast, if sq = 1^ then the secondary bets more power in the later time- slots as there is 
a higher chance that the primary will turn off in those slots, thereby creating a better channel 
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for the secondary flow. In the next section, we show that the above result can be significantly 
strengthened for the special case of perfect state estimate sq = sq. 

C. Special Case I: Secondary with perfect estimate of primary state, Sq = sq 

In this section we assume that there is no error in secondary's estimate of Sq. When the starting 
state is perfectly known, each of the T subchannels shown in Figure 3 behave as parallel channels. 
For such a channel capacity can be achieved by sending at a constant power with receiver side 
channel side information [13]. We show below that Gaussian codewords with power levels 

Pst = Psst + PsNt achieves the capacity if the sensing is error-free. 

Theorem 3 (Capacity with perfect sensing, Sq = Sq): With perfect sensing, the capacity for 
the discrete cognitive interference channel is given by. 

Proof: See Appendix D. ■ 

The above result for finite input-output alphabets extends to continuous alphabets such that 

I{X?; Y.^'^U s) = /3,(t)C,o + mC^u E £ Aso)Pst < SNR2, p,* < for t = 1, • • • , T 

soeSt=i ' ' 

and p{X2^\so) ~ J\f{0,pst)- In order to find the power allocation pst, we have to solve an 
optimization problem similar to (5). When sensing is perfect Sq — Sq and P(sj|so) = 1- There is 
no missed detection or false alarm, i.e. Vm — and Vf — 0. So, the rate and all the constraint 
equations for the perfect sensing protocol can be obtained by making these substitutions in (6). 

Theorem 4 (Optimal Layer 2 power): For the optimization problem given in (6) with Vm = 
Vf = and sq = sq, the optimal power of the Layer 2 Gaussian codeword is given by, p^g^ = 
(min (SICc, INRc, SNRa))^ , where SICc, INRc are defined in Theorem 1 and x+ = max{x, 0). 

Proof: See Appendix E. ■ 
Due to the perfect estimate of the primary starting state, the rate splitting done at secondary is 
optimal, which allows us to find the exact value of the Layer 2 codewords. The monotonicity 

properties of the power levels still hold for the perfect sensing case. 

Theorem 5 (Monotonic Layer 1 profile): For the optimization problem given in (6) with Vm ~ 
Vf = 0, > p*jv2 > ■ ■ ■ > PoJVT and p\^^ < pl^^ <...< pInt- Additionally, 
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(i) If P*oNt = then plj^^^^, p^^^ = 

(ii) If p*^, = then pt^i, . . . , p\^^_^ = 

(iii) If p*jvt = INRc then . . . , p^^^.^ = INR^ 

(iv) If p*jvt = INRc then pt;vt+i> ■ ■ ■ > pIatt = INRc 

Proof: The proof follows directly from Theorem 2 for sq — and T'm = = 0. ■ 
The intuition for the above results is as follows. It is always better to allocate more power to 
Layer 2 codewords (while satisfying the power and interference constraints) as it has a higher 
contribution towards the secondary rate R2. For a fixed \hi2\, below a certain INRgap, this upper 
bound is zero and all the power goes to the Layer 1 codeword. As INRgap increases, the proportion 
of the Layer 2 codeword keeps increasing and in the end all the power is put into the Layer 2 
codeword. In short, we have to do layered water-filling for each time slot in the block as shown 
in Figure 4. Next, we show that the opportunistic superposition of [19] is a special case of the 
coginitive protocol when no sensing is done. 



D. Special Case II: Secondary with no information about primary state sq 

Consider the special case when the secondary does not sense the channel at all, i.e. the 
secondary transmitter only knows the statistics of the primary traffic 7r(so) and /t(t). Alternately, 
the estimate sq is so noisy that it does not provide any information about sq. A similar analysis 
of this special case can also be found in [20]. Out of the available power SNR2, ps — q;SNR2 is 
assigned to the Layer 2 codeword and pjv = (1 — a)SNR2 is assigned to the Layer 1 codeword. 



The average power constraint for the secondary is given by pn + Ps < SNR2 and the INR 

INRgaj 



constraint can be written as Pn + Ps < jrri^- After decoding the single user codeword, the 



residual capacity of the channel is given by C ^jq^^R^^- To ensure primary is decodability 
the SIC constraint is given by, Ri<C (j^^) (i.e. a < '^'"'"^Sr^^^"^ )- ^uch an a, a 
secondary rate of .^2(0;) = C ( i+inrz+oSnrz ) "'"^ (q;SNR2) can be achieved. Gaussian codebooks 
are optimal in this case too by Lemma 1. The optimal value of the superposition fraction (a*) 
is one which maximizes R2{(y) while satisfying all the constraints. 

Theorem 6 (No-sensing Capacity): For a cognitive interference channel where the secondary 
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does not sense the channel, the optimal superposition fraction is given by 



a — \ 






if INR2 


^ SNRi 
— l+INR„„p 


SICc 


SNRi 


< INR2 < 


SNR2 


1+INRgap 


1 


if INR2 


^ SNRi 
- 1+INRgap 



1+INRgap 



and the capacity is given by R2[q.*\ 

Proof: See Appendix F. ■ 

From the secondary receiver's perspective, the equivalent channel is a MAC. The fixed primary 
converts the equivalent MAC rate region (pentagon) to a single Une (Figure 5(a)). Even though 
the rate splitting assumes sequential decoding, it turns out to be optimal [3] because the two 
code layers at the secondary, makes the rate tuple a comer point of a three user virtual MAC 
[20] consisting of the two code layers of secondary and the primary. 

Remark 1 (Persistent Primary): If the primary has persistent data (vr(l) = 1), the rate that the 
secondary can achieve is the same as proved above, i.e. the effective channel in the no-sensing 
case is a compound channel and the secondary has to code for the worst case. 

The optimal distribution puts as much available power in pis as possible without violating 
the SIC, INR and power constraints. This can be thought of a layered water-filUng as shown in 
Figure 4. The layer due to the primary codeword is fixed. Available power is first assigned to 
the Layer 2 codeword. When the fixed layer is very high so that no power is left to put in the 
Layer 1 codeword, a* = and if the fixed layer touches the noise floor, all the power is put in 
the Layer 1 codeword, a* = \. The idea of opportunistically doing interference cancellation has 
also been analyzed in [?, 19] and is a restatement of the rate splitting approach introduced in 
[20] for achieving time-sharing without cooperating encoders. The difference from [19] is the 
effect of the secondary transmissions on the primary which gives rise to the power profile and 
the proof of optimality using rate splitting and sequential decoding. 

Figure 5(b) shows a detailed view of the the different rate splitting regions as a function of 
INRgap and INR2, for fixed SNRi, SNR2. In Region 1, the secondary user treats the primary data 
as noise {a* — 0). In region 2 the secondary uses both layers. In Region 3 the secondary receiver 
first decodes the primary's data and then decodes its own {a* — 1). The Layer 1 protocol region 
is always the same w.r.t. INRi. The increase in the Layer 2 protocol region with the increase in 
the |/i2i|^ can be attributed to the interference constraint at the primary receiver. This increase 
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Only Layer 1 



Both 



Only Layer 2 



R2 R22 




(a) Effective MAC (as seen by the secondary receiver) 



(b) Detailed view of the rate splitting regions 



Fig. 5. (A) The effective MAC as seen by the secondary receiver, where R\ = C{ i^'^snr2 ) decodable primary rate as 

seen by the secondary receiver, R\2 = C(INR2) is the rate supported by the cross channel between the primary transmitter and 
the secondary receiver, i?2(a) is the rate achievable by the secondary and R22 = C(SNR2) is the maximum rate achievable by 
the secondary if there is no primary. (B) Detailed view of the regions of operation for the secondary user for SNRi = SNR2 = 7 
as a function of the two variables INRgap and INR2. The secondary user uses Layer 1 codewords in Region 1, Layer 2 codewords 
in Region 3 and a superposition code with both layers in Region 2. Note that INRgap = is not a part of any region. 



comes at the expense of the superposition protocol region. When |/i2iP increases, the power 
available to the secondary user keeps on decreasing due to the interference constraint. This 
decreases the power left for the Layer 1 codeword. Hence at higher |/i2ip, a part of the mixed 
protocol region gets converted to a multiuser protocol region as there is no more power left to 
put into the single user codeword. 

The paranoid power profile is optimal if the sensing is perfect. However, even if the sensing 
is noise free, the time spent in sensing is still an overhead. This overhead can be high enough 
for the no-sensing scheme to outperform the perfect sensing scheme in some regimes, as shown 
in Figure 6. Along the curve, points close to the right side (i?2 = 0) are obtained for INRgap = 
and the points to the left side = 0) are obtained for INRgap = oo. Figure 7 plots the two 
extreme rate points on the y-axis in Figure 6 (where Ri = 0) for different block sizes. The 
above-mentioned loss due to sensing, decreases as T increases. Finally, the bounds are closer 
together when the residual capacity of the primary channel is smaller as shown in Figure 8. 
Different values of /3 has no effect on the no-information lower bound. But the genie-aided 
upper bound is able to get a higher rate if the channel is idle for a longer period of time. This 
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Fig. 6. The no-information lower bound can outperform perfect sensing sclieme witli a finite sensing overhead depending on 
the INRgap, the blocklength T. Here, Pi = P2 = 7,^ = 0.5,7r(0) = 7r(l) = 0.5, |/ii2p = |/i2ip = 0.5 and T = 10. 



means, when (3 is high, the performance of both these bounds are close. 




Blocklength (T) 



Fig. 7. The maximum R2 achieved (two extreme rate points on the y-axis in Figure 6 where _Ri = ) by the paranoid 
profile scheme gets closer to the genie-aided upper bound as T increases which is due to smaller sensing overhead. Here, 

Pi = P2 = 7,/? = 0.5,7r(0) = 7r(l) = 0.5, |fti2p = |/i2i|^ = 0.5. 



IV. Conclusion 

We approximate an interference channel in the presence of a fixed sporadic primary flow 
as a block activity channel. Such a block activity model can be broken into parallel channels 
in time and multiplexed codebooks can be used when there is perfect sensing. We derived a 
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1.6 




1 



Fig. 8. Comparison of no-sensing schieme and the Genie-aided sciieme for different persistence (/3) of the primary user's data 
and jfti2p = \h2i^ ~ 0.5. The dashed lines are for the no-sensing scheme and the solid lines are for the genie-aided scheme. 



paranoid scheme for the secondary user when the primary user is transmitting below capacity 
and show that rate splitting at the secondary transmitter with sequential decoding is optimal 
when the estimation of the primary starting state is noise-free. Depending on the starting state 
of the primary during a block, the optimal power profile for Gaussian inputs is either growing or 
decaying in power as a function of the time slot. If the sensing overhead is considered, we showed 
that the paranoid scheme approaches the genie-aided scheme for large block lengths. Finally we 
show numerically that the paranoid and genie aided schemes approach the no-information scheme 
when the primary channel is operating close to its capacity. 

Appendix A 

Proof of Theorem 1 : We derive a special case of the water- filling result [8] for a channel with 
two states, which will be used to prove the optimal power allocation for the genie-aided case. 
Lemma 2: For the optimization problem given by 

max ^C(po) + /3C(-^) 

subject to /3po + I^Pi < 7 and pi < 5 (7) 

with non-negative parameters (a,/3,7,5), the maximizing po,Pi is characterized as follows. 



19 



;^,o) ifp^<o<5 

(Po,Pi) = < (p*,p*) if0<pl<5 

if0<5<pl 
where Po = 7 + /3a, Pi = 7 — /3a; and j3 = 1 — j3. 



(A) 
(C) 



Proof: Let us start with assuming that there is no constraint on pi. The optimal solution 
is given by Po = {j — l)^ 'Pi ^ (i ~ -'^ ~ '^)^' where A is chosen such that /3po + /3pi = 7. 
If ^ > 1 + a, then ^ = 7 + /3 + + a). Therefore, Po = 7 + /3a and p* = 7 - /3a. Note 
^ > 1 + a also ensures p* > and by assumption p* < 5. This proves part B of the Lemma. 

If ^ < 1 + a, Pi = and A is such that /3po = 7, then po = ^,pi = 0. This proves part A 
of the solution. As long as pi lies below 5 (i.e. the extra constraint on pi is not active at the 
solution), the same solutions hold. If pi > 6, pi = 6, po = ■ 

The constraint set defined by the average power constraint at the secondary transmitter, the 
INR constraint at the primary receiver, and the SIC constraint at the secondary receiver is 
schematically shown by the three colored planes in Figure 9. 



Pis SNR2 



Pis = SlCr 




IN + Pis = INRc 



/3po + (3{piN + Pig) ¥ SNRa^^ 



Fig. 9. General Constraint set for tiie perfect sensing optimization problem. 



First consider the case when the SIC constraint is not active i.e. when SICc > min (SNR2, INRc). 
In this case, the solution lies on the po— P15 plane of Figure 9. This is because R2ipo, Pin, Pis) < 
R2iPo, PiN — f^, Pis + f^) as long as K < p^v, i.e. we can achieve a higher rate by taking away 
some power from puy and giving it to pis as long as all the constraints are satisfied. Therefore, 
to maximize the rate, all the available power is allocated to pis since there is no SIC constraint. 
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When we put pi^r = 0, the optimization problem looks like Equation (7) with a — — SNR2 
and 5 = INRc. By using Lemma 2, the solution is given by Equation 2. 

Finally, if SICc < min (SNR2, INRc), the SIC constraint is active at the solution and the 
solution lies on the SIC constraint plane of Figure 9. In contrast to the above cases, we cannot 
put all the power from puv in pis. We increase pis till pis — SICc which equals the SIC 
constraint. Now the problem is same as Equation 7 with a — |/ii2p(l +INRgap + SNRi) — 1,7 = 
SNR2 — /5SICc and 5 — INRc — SICc. Using Lemma 2, and the solution is given by Equation 3. 

Appendix B 

Lemma 3 (Change in Order of Summation): Given a probability mass function, /(t), 1 < t < 
T of a discrete random variable, and positive real numbers (pi, . . . , pt), ol and /3, 



T+l 



i=l i=j 



i=l 

where C{x) = log(l + x), F{j) = J^Ui fii), 1 < j < T and F(0) = 0, F(T + 1) = 1. 
Proof: Let us first consider the first term of the left hand side with a = 1, 

T+l t-1 

J2f(^)J2<P^) =/(2)c(pi) + /(3)[c(pi) + c(p2)] + . . . + /(T + l)[c(pi) + . . . + c{pt)] 
3=1 i=i 

T T+l 

=c(pi)[/(2) + . . . + /(T + 1)] + . . . + c(pT)/(r + 1) = E <P^) E /(^^ 

i=l j=i+l 
T+l j 

From the definition of the probability distribution function, 1 — F{j) — ^ f{i) — ^f{i) = 

i=l i=l 

T+l T+l t-1 T T+l T T+l T 

E /(i). Hence, E /(j) Ec(a) = E ^<P.) E /(j) = E(1-^W)c(a). Similarly, E /(j)Ec(a) 

i=j+l j=l i=l i=l j=i+l i=l j=l i=t 

T 

E F{t)c{pi), which completes the proof. ■ 

i=l 

Appendix C 

Proof of Theorem 2: We will drop the subscript on s, s to reduce clutter. The Lagrangian 
for the optimization problem in Theorem 2 can be written as. 

Hp, Ai, A2) = R2{p) - Ai5i(p) - ^ \2stg2st{p) - E hstgsstip) - ^ MstPsm - ^ >^bstgsst- 

s,t s,t s,t s,t 
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The partial derivatives of the Lagrangian is given by, 

^ = - V n(s)ms) ( + 

Spsst ^ ^ ^"^^ ' ^ V 1 + Psm + Psst (1 + INR2 + psm + (1 + INR2 + Psst) 
- Ai ^ 7r(s)p(s|s) - Aast - >^zst - Kst- 

s 

The first order necessary conditions (the complementarity conditions are excluded for lack of 
space) are given by the following 12T + 1 equations, = 0, = 0,gi{p) > 0,g2st > 
0, g3,t > 0, and p^m, PsSt > 0, for s e {0, 1}, t e [1, T]. If |/ii2p(INRgap + 1) < 1, then p,st = 0. 
Hence g2st, ^2st are redundant, and by complementarity X5st — 0. Similarly, if < psm < INRc 
by complementarity X^st — and Xist — 0. As the objective is an increasing function in each 
PsNu the average power constraint is met with equality, i.e. Ai > 0. For |/ii2p(l + SNRgap) < 1 
and < psst < we have 

For a given s independent of any t, the right hand side of the above equation is constant. 
Rewriting the left hand side for s — 0, 

.(0)(1 - P.) + —Ml ] + .(i)p^ (Jl^ + ^^(^) ) . const. 

^ ^Vl + PoATt l + INR2 + poivty \l + Pom l+INR2 + poJvJ 

■ ^ ma-PF){l-F{t))+n{l)PMF{t) ^ n{0){l-PF)F{t)+n{l)PM{l-F{t)) ^ ^^^^^ 
1 + Pom 1 + INR2 + Pom 

G(t) H{t) 

I.e. — \ — = const.. 

1 + Pom 1 + INR2 + Pom 

where G{t) = 7r{0){l-PF){l-F{t))+7r{l)PMF{t) and H{t) = 7r(0)(l-Pi.)F(i)+7r(l)PM(l- 

F(t)). Rewriting the equation again, 

G{t) , H{t) _ G{t + l) ^ H{t+1) ^ 



1 + Pom l + INR2+po7vt 1 + Poivt l + INRa + poivt 

" V ' ^ V- 

At Bt 



1 + Poivt 1 + INR2 + PoiVt 

_ F(t,)) ( : 

Pom 1 + INR2 + po7vt. 



(7r(0)(l - Pf) - n{l)PMmt + 1) - F{t)) 
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To make the right hand side Aj+i, we have to remove Ct and replace poivt by poNit+i) in ^t- 
Since C( > when 7r(0)(l — Pp) > n{l)PM, and Bt is an decreasing function of poNt, we have 
Pom ^ PoN{t+i)- Equality holds when Fxit) = FT{t + 1). As this is true for any t G [1,7"], 

Pom — PoN2 ^ • • • ^ Pont- 

For a given s, say s = 0, if there is a i such that Pqat* — 0' t^en we will now prove by 
contradiction that Pom+i — ■ ■ ■ — Pont — 0- Let us assume that Pom+i = p > 0. Define 
R{y,z) = R2iPoNi, ■ ■ ■ , Pom = y,PoNt+i = ^, Poatt, Piati, Piatt)- Note -R2(0,p) < 
i?2(p, 0) for any t e [^,T] and any p > which satisfies all the constraints. 



R^ip, 0) - R^iO, p) =(1 - F{t))C{p) + F(i)C (^^p^ 

-(l-F(t + l))C(p) + F(. + l)c(^^) 



=(F(t + 1) - F(t)) ( C(p) - C ( ) ) > 0, 



1 +INR2, 

since INR2 > and F{t + 1) > F{t). Hence if pom — then pom+i — and similarly for 
t + 2, . . . , T. We can prove the other parts of the theorem in the exact same way. For all previous 
time slots, the ordering follows the same order, 

• If PoNt=o then p^^^ > plj^^ > Pmt-i > Poat* = ■ ■ ■ = Poatt = 0- 
And due to the same reasons, 

• If PiATt = then = pt^2 = ■ ■ ■ = P*im-i = < p^jvt < ■ ■ ■ < pInt- 

• If Pom = |/~p then p^^^ = plj^2 = ■ ■ ■ = PoNt-i — — Pmt ^ • • • ^ Pont- 

• If PiAft = then pI^^ > p\^2 > • • • > PiNt-i > PiNt = ■ ■ ■ = Pint = JJ^- 

Note, for the case when SICc > then psst > 0. From Theorem 4, the value of psst is fixed. 
Hence, we can remove it from our problem statement along with g2st and X2st- The part 
of the Lagrangian follows in the same lines. 

Appendix D 

Proof of Theorem 3: The encoding and decoding functions for each time-slot is same as 
given in Section II-C. The primary state process {St} is assumed to be an irreducible, aperiodic, 
finite-state homogeneous Markov chain and is independent of the secondary channel's input and 
outputs. 
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Achievability follows from Theorem 1 in [22], when the state is perfectly known. We give an 
outline here. 

1) Achievability: During the i"^ block, with the starting state sq, the encoder chooses the 
symbol of the T codewords of length -K^s^jB for T messages and multiplexes them /3t(so) = 
(pi(so), ■ ■ ■ , Pt{sq)), i — 1 . . . ,B. For large B the capacity of the component channels Cso(^) — 
max ^p(s|so)-?^(-'^; Y\s, so) can be achieved. The final rate is calculated by summing over the 

p(x\sq) s 

T 

T multiplexed codewords, C = J2 ^r+i ^ (^) which concludes the achievability. 

So t=l 

2) Converse: Let W be the message random variable. The capacity can be written as 

(T+1)C = limsup - ^(^o) Yl ^ § limsup - ^ 7r(so) max^ /(VF^; f/*^", ^o, 5^"), 
" ^ so t=i " ^ so Pi^" t=i 

where (a) follows the same steps as in Equation (4) of [22], and the superscript {t)n refers to 
time slot t of block n. Now, 

T T 



t=l t=l 

T n 



T n 

T rt 
t=l i=l 

= EEA#^^/*^i^f\^f), 

where (6) follows from the fact that the state is independent of the message and (c) follows from 
the fact that entropy decreases on conditioning and Yi is independent of other random variables 
when conditioned on Wi and Sq \ Finally, 



" SO ^ ^ t=l «=0 



limsup > —— r > max 



/(XW;FW|5o,5W) 



50 

T 

where (c?) follows the same steps as Equation (8) in [22] which completes the proof. 
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Appendix E 

Proof of Theorem 4: The solution can be rewritten as, 

if SICc < 

p*sst = < SICc if < SICc < min (SNR2, INRc) ■ (8) 

min (SNR2 , INRc) if SICc > max (SNR2 , INRc) 
The first part follows trivially from the definition of the problem. To prove the other parts, 
let the optimal solution to the optimization problem be given by (/t>o>Pi) ^^'^^ that p*gg^ < 
SICc for some t e [1,^] and s e {0, 1}. Let us consider p*sNvP*sSf keeping all other powers 
fixed. The constraints for this pair (except the positivity and complementarity constraints) are 
A + 7r(s)(p*jvt + p*sst) ^ 7'SNR2, plst < SICc, and p*^^ + p*^^ < INRc , where the contribution 
to the average power from all other powers except (p*^^, p*gs^ are lumped into A. The rate can 
be written as, 

R2{p:m. Plst) = R^+^s) \mc{p:m+p:st)+m (c , ] +c{p:st) 



L V ^1 + INR2 + p:5,, 

The contribution due to all other powers to the average rate has been lumped into ^^2"^. Now 
let us take away a small part of the power from p*^^ and add it to p*^^. This has no effect 
on the sum p*^^ + p*^^, hence the constraints remain unchanged. We chose a S > such that 
P*Nt - > and Plst + S< SICc. It is easy to see that R2(p*snv P*sSt) < Mplm - P*sSt + <^)' 

i-e- fl + ^tt^SFv^) (1 + Plst) < fl + ^rr^S^^i—^ (1 + Plst + 8) 



i+inr2+p:5j^ V 1 + 1NR2 + p:s, + (5 

. (1 + INR2 + pl^t + plst){^ + Plst) (1 + INR2 + pl^t + Plst)i^ + Plst + ^) 
1 + INR2 + Plst 1 + INR2 + Plst + ^ 

since INR2 > 0. This perturbation can only go on till either psm reaches its lower Umit, or psst 
reaches the upper limit. We hit the lower limit of Psm first if SICc > max (SNR2, INRc). In 
this case, p*;^^ = and psSt — min (SNR2, INRc) ■ Otherwise, we hit the upper limit psSt first, 
if PsSt = SICc. In either case, the result is independent of the particular value of t. 

Appendix F 

Proof of Theorem 6: The only variable is the superposition fraction a, so the problem is one di- 
mensional. The derivative of the optimization function is given by, ^^^^ = (i+asm5ti+i>«^+«SNR2) ' 
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The slope of the optimization function is always positive, i.e. > 0. Hence, the optimal 



solution is achieved at the upper boundary of the intersection of the constraints on a. which are 

^(l+INRgap)-l 

< a < 1 and a < — ^ . If the right hand side is less than or equal to 0, then a* = 0, 

if the right hand side is more than 1, a* — 1, otherwise, a* — — ^ . For \hi2\ = 1, 

this is same as the rate splitting result for two users as shown in [20]. 

a) If INR2 > ^^^^ySM!^-' ™s ^^^^^ the decodability condition C > 
C (^Yi^—^ ■ Hence, after decoding off the primary, the rate that can be achieved by the 
secondary is R2{a*) = C (SNR2). But this is the single user capacity of the secondary channel 
for the given power constraint when there is no primary interference. Therefore, this scheme of 
using only a Layer 2 codeword is optimal in this case. 

b) If INR2 < Tz^~-' the primary cannot be decoded even if no secondary data is sent i.e. 
C (SNR2) < C ^ 1+^, ^ • So, in this case, the only option is to treat this undecodable signal as 
noise. The effective channel now behaves as a point to point channel with AWGN A/^(0, 1+INR2). 
Hence the capacity of this channel is given by /?2(«*) = C (^-^^^ ■ 

c) If < ^^^2 < ^^X' •• For the rest of the parameter range, the rate is a corner 
point of the 3 user virtual MAC formed by the primary and the two layers of the secondary 
(Figure 5(a)). The fixed primary transceiver converts the effective MAC (from the point of view 
of the secondary receiver) into a straight line and the secondary splits its information into two 
layers to achieve the boundary point. The sum capacity of the effective MAC is achieved by this 
scheme since for any a, C (^^J^gl^) + C (j^^) + C (aSNR2) = C (INR2 + SNR2). 
Additionally a* gives the best rate among all the secondary rates. Hence the layered encoding 
at the secondary transmitter and sequential decoding at the secondary receiver is optimal and 
the capacity is given by R2{a*). 
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